MobileLLM is an autoregressive language model optimized for device-side applications. It adopts an optimized Transformer architecture and integrates key technologies such as the SwiGLU activation function, deep and narrow architecture, embedding sharing, and grouped query attention, providing powerful language processing capabilities on resource-constrained device sides.
Natural Language Processing
TransformersEnglish